Skip to content

Replace llvm Intrinsics with clang buildins#65

Merged
asroy merged 8 commits into
developfrom
xdlops_buildins
Feb 3, 2022
Merged

Replace llvm Intrinsics with clang buildins#65
asroy merged 8 commits into
developfrom
xdlops_buildins

Conversation

@zjing14
Copy link
Copy Markdown
Contributor

@zjing14 zjing14 commented Jan 11, 2022

No description provided.

@asroy
Copy link
Copy Markdown
Contributor

asroy commented Jan 27, 2022

Is this PR ready to review or still WIP?

@zjing14 zjing14 changed the title [WIP] Replace llvm Intrinsics with clang buildins Replace llvm Intrinsics with clang buildins Jan 27, 2022
@zjing14
Copy link
Copy Markdown
Contributor Author

zjing14 commented Jan 27, 2022

Now, the PR is ready.

@zjing14 zjing14 requested a review from asroy January 27, 2022 16:13
@asroy asroy merged commit 6d92959 into develop Feb 3, 2022
@illsilin illsilin deleted the xdlops_buildins branch December 7, 2023 18:39
carlushuang pushed a commit that referenced this pull request Jan 31, 2024
* add lse parameters to kernel

* add store lse in kernel

* add lse host ref and check result

* add parameter to control store lse or not

* fix kernel template kStoreLSE value

* move lse store to pipeline

* fix output err info

* fix output err info2

* change lse 4 dim to 3 dim

* mask storing lse in example

* remove divide in kernel

* remove pointer in function reference_batched_softmax

* set LSE is template parameter in FmhaFwdKernelSelector

* remove parameter stride_lse

* fix bug for using nullopt in function reference_batched_softmax

---------

Co-authored-by: letaoqin <letaoqin@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants